Exploring Compositional Data with the CoDa-Dendrogram
نویسندگان
چکیده
Abstract: Within the special geometry of the simplex, the sample space of compositional data, compositional orthonormal coordinates allow the application of any multivariate statistical approach. The search for meaningful coordinates has suggested balances (between two groups of parts)—based on a sequential binary partition of a D-part composition—and a representation in form of a CoDa-dendrogram. Projected samples are represented in a dendrogram-like graph showing: (a) the way of grouping parts; (b) the explanatory role of subcompositions generated in the partition process; (c) the decomposition of the variance; (d) the center and quantiles of each balance. The representation is useful for the interpretation of balances and to describe the sample in a single diagram independently of the number of parts. Also, samples of two or more populations, as well as several samples from the same population, can be represented in the same graph, as long as they have the same parts registered. The approach is illustrated with an example of food consumption in Europe.
منابع مشابه
Signal Interpretation in Hotelling’s T 2 Control Chart for Compositional Data
Nowadays, control of concentrations of elements is of crucial importance in industry. Concentrations are expressed in terms of proportions or percentages which means that they are compositional data (CoDa). CoDa are defined as vectors of positive elements that represent parts of a whole and usually add to a constant sum. Classical T 2 control chart is not appropriate for CoDa, for which is bett...
متن کاملSpatial modelling of zonality elements based on compositional nature of geochemical data using geostatistical approach: a case study of Baghqloom area, Iran
Due to the existence of a constant sum of constraints, the geochemical data is presented as the compositional data that has a closed number system. A closed number system is a dataset that includes several variables. The summation value of variables is constant, being equal to one. By calculating the correlation coefficient of a closed number system and comparing it with an open number system, ...
متن کاملPhytoplankton composition in shallow water ecosystems: influence of environmental gradients and nutrient availability
Environmental gradients caused by hydrological changes, whether natural or maninduced, affect the planktonic taxonomic and functional composition in shallow water ecosystems. In this sense, our aim was to find out the main variables or variable ratios that are the driving forces of the major phytoplankton taxonomic groups in Mediterranean coastal lagoons. For this purpose, 11 waterbodies were c...
متن کاملOptimality Theoretic Account of Acquisition of Consonant Clusters of English Syllables by Persian EFL Learners*
This study accounts for the acquisition of the consonant clusters of English syllable structures both in onset and coda positions by Persian EFL learners. Persian syllable structure is "CV(CC)", composed of one consonant at the initial position and two optional consonants at the final position; whereas English syllable structure is "(CCC)V(CCCC)". Therefore, Persian EFL learners need to resolve...
متن کاملData on fatty acid profiles of green Spanish-style Gordal table olives studied by compositional analysis
This article contains processed data related to the research published in "Tentative application of compositional data analysis to fatty acid profiles of green Spanish-style Gordal table olives" (Garrido-Fernández et al., 2018) [1]. It provides information on the implementation of compositional data analysis (CoDa) to the fatty acid profiles of Spanish-style Gordal table olives vs the use of co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011